Acoustic analysis of pathological voices compressed with MPEG system.
نویسندگان
چکیده
The MPEG-1 Layer 3 compression schema of audio signal, commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while conserving a high sound quality. Music and speech samples compressed at high bitrates are perceptually indistinguishable from the original samples, but very little was known about how compression acoustically affects the voice signal. A previous work with normal voices showed a high fidelity at high-bitrate compressions both in voice parameters and the amplitude-frequency spectrum. In the present work, dysphonic voices were tested through two studies. In the first study, spectrograms, long-term average spectra (LTAS), and fast Fourier transform (FFT) spectra of compressed and original samples of running speech were compared. In the second study, intensities, formant frequencies, formant bandwidths, and a multidimensional set of voice parameters were tested in a set of sustained phonations. Results showed that compression at high bitrates (96 and 128 kbps) preserved the relevant acoustic properties of the pathological voices. With compressions at lower bitrates, fidelity decreases, introducing some important alterations. Results from both works, Gonzalez and Cervera and this paper, open up the possibility of using MPEG-compression at high bitrates to store or transmit high-quality speech recordings, without altering their acoustic properties.
منابع مشابه
A computer system for acoustic analysis of pathological voices and laryngeal diseases screening.
A system for acoustic analysis of pathological voices is proposed. The vocalized part of the voice signal is separated and all glottal cycles are traced by means of a cross-correlation detector. Based on the so determined beginning and duration of all glottal cycles, shimmer, jitter, several harmonics-to-noise ratios and other widely used acoustic parameters are calculated. New parameters are a...
متن کاملOn the Use of the Correlation between Acoustic Descriptors for the Normal/Pathological Voices Discrimination
This paper presents an analysis system aiming at discriminating between normal and pathological voices. Compared to literature of voice pathology assessment, it is characterised by two aspects. First the system is based on features inspired from voice pathology assessment and music information retrieval. Second the distinction between normal and pathological voices is simply based on the correl...
متن کاملDoes it affect feature "sex" on automatic detection of impaired voices?
Voice registers are widely affected when voice diseases appear. These diseases have to be diagnosed and treated during an early stage. Detection of voice diseases may be carried out by means of acoustic analysis of voice register. Many algorithms to calculate acoustic parameters have been developed and have been demonstrated that there is a great correlation between parameter deviations and imp...
متن کاملVoice pathology detection and classification using MPEG-7 audio low-level features
In this paper, a new pathological voice detection and pathology classification method based on MPEG-7 audio lowlevel features is proposed. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 audio low-level features can do ver...
متن کاملDysphonia is beautiful: A perceptual and acoustic analysis of vocal roughness
Researchers as well as speech therapists are interested in the determination of reliable acoustic cues that may be useful for the evaluation of vocal quality as well as for the diagnosis of vocal pathologies and remediation. In this way, experimental phonetics can be useful to clinical practice. This work which tries to connect phonetics and logopedic science, deals with the esthetic quality of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of voice : official journal of the Voice Foundation
دوره 17 2 شماره
صفحات -
تاریخ انتشار 2003